CTGAGCTGACCTGACCCCCAAAGTGAAGGAGAAGCTGCAAGGGAAAAGGGAGGGACAGAT 

1 + + + + + + gQ 

GACTCGACTGGACTGGGGGTTTCACTTCCTCTTCGACGTTCCCTTTTCCCTCCCTGTCTA 

CAGGGAGACCGGGGAAGAAGGAGGAGCAGCCAAGGAGGCTGCTGTCCCCCCACAGAGCAG 

61 + + + + + + 120 

GTCCCTCTGGCCCCTTCTTCCTCCTCGTCGGTTCCTCCGACGACAGGGGGGTGTCTCGTC 

CTCGGACTCAGCTCCCGGAGCAACCCAGCTGCGGAGGCAACGGCAGTGCTGCTCCTCCAG 

121 + + + + : + + 180 

GAGCCTGAGTCGAGGGCCTCGTTGGGTCGACGCCTCCGTTGCCGTCACGACGAGGAGGTC 

CGAAGGACAGCAGGCAGGCAGACAGACAGAGGTCCTGGGACTGGAAGGCCTCAGCCCCCA 

181 + + + + + + 240 

GCTTCCTGTCGTCCGTCCGTCTGTCTGTCTCGAGGACCCTGACCTTCCGGAGTCGGGGGT 

• GCCACTGGGCTGGGCCTGGCCCAATGGCCTTTAATGACCTCCTGCAGCAGGTGGGGGGTG 

241 + + + + + + 300 

CGGTGACCCGACCCGGACCGGGTTACCGGAAATTACTGGAGGACGTCGTCCACCCCCCAC 

MAFNDL LQQ. VGGV- 

TCGGCCGCTTCCAGCAGATCCAGGTCACCCTGGTGGTCCTCCCCCTGCTCCTGATGGCTT 
301 + + + + + + 360. 

AGCCGGCGAAGGTCGTCTAGGTCCAGTGGGACCACCAGGAGGGGGACGAGGACTACCGAA 

GRFQQ IQVTLV -VL PLLLMAS- 

CTCACAACACCCTGCAGAACTTCACTGCTGCCATCCCTACCCACCACTGCCGCCCGCCTG 

361 + + + + : + + 420 

GAGTGTTGTGGGACGTCTTGAAGTGACGACGGTAGGGATGGGTGGTGACGGCGGGCGGAC 

H N T L Q N F T A A I P T H H *C R P.P A - 

CCGATGCCAACCTCAGCAAGAACGGGGGGCTGGAGGTCTGGCTGCCCCGGGACAGGCAGG ■ 

421 + + + + + + 480 

GGCTACGGTTGGAGTCGTTCTTGCCCCCCGACCTCCAGACCGACGGGGCCCTGTCCGTCC 

D AN L S KNGGL EVWL P RDRQG- 

GGCAGCCTGAGTCCTGCCTCCGCTTCACCTCCCCGCAGTGGGGACTGCCCTTTCTCAATG 

481 + + + + • + + 540 

CCGTCGGACTCAGGACGGAGGCGAAGTGGAGGGGCGTCACCCCTGACGGGAAAGAGTTAC 

QPES CLRFTS PQWGL PFLNG- 

GCACAGAAGCCAATGGCACAGGGGCCACAGAGCCCTGCACCGATGGCTGGATCTATGACA 
541 + . + + + + + 600 

cgtgtcttcggttaccgtgtccccggtgtctcgggacgtggctaccgacctagatactgt 

te'a.ngtgatepctdgwiydn- 
acagcaccttcccatctaccatcgtgactgagtgggaccttgtgtgctctcacagggccc 

601 + + + + + + 660 
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TGTCGTGGAAGGGTAGATGGTAGCACTGACTCACCCTGGAACACACGAGAGTGTCCCGGG 

STFPSTIVTEWDLVCSHRAL- 

TACGCCAGCTGGCCCAGTCCTTGTACATGGTGGGGGTGCTGCTCGGAGCCATGGTGTTCG 

661 + + + + + + 720 

ATGCGGTCGACCGGGTCAGGAACATGTACCACCCCCACGACGAGCCTCGGTACCACAAGC 

RQLAQS. LYMVGVLLGAMVFG- 

GCTACCTTGCAGACAGGCTAGGCCGCCGGAAGGTACTCATCTTGAACTACCTGCAGACAG 

721 + + + + + ---+ 780 

CGATGGAACGTCTGTCCGATCCGGCGGCCTTCCATGAGTAGAACTTGATGGACGTCTGTC 

YLADRLG RRK VLI LN YL QTA- 

CTGTGTCAGGGACCTGCGCAGCCTTCGCACCCAACTTCCCCATCTACTGCGCCTTCCGGC 

781 + + + + + + 840 

GACACAGTCCCTGGACGCGTCGGAAGCGTGGGTTGAAGGGGTAGATGACGCGGAAGGCCG 

VSGTGAAFAPNFPIY- C A F R L - 

TCCTCTCGGGCATGGCTCTGGCTGGCATCTCCCTCAACTGCATGACACTGAATGTGGAGT 

841 : -- + + + + + + 900 

AGGAGAGCCCGTACCGAGACCGACCGTAGAGGGAGTTGACGTACTGTGACTTACACCTCA 

L S GMALAG I S LNCM T L N V E W - 

GGATGCCCATTCACACACGGGCCTGCGTGGGCACCTTGATTGGCTATGTCTACAGCCTGG 

901 t + : + + + + + 960 

CCTACGGGTAAGTGTGTGCCCGGACGCACCCGTGGAACTAACCGATAC AGATGTCGGACC 

MP I HTRACVGTLIGYVYS LG- 

GCCAGTTCCTCCTGGCTGGTGTGGCCTACGCTGTGCCCCACTGGCGCCACCTGCAGCTAC 

961 + + + + + + 1020 

CGGTCAAGGAGGACCGACCACACCGGATGCGACACGGGGTGACCGCGGTGGACGTCGATG . ■ 

QFLLAGV'AYAVPHWRH-LQLL- 

TGGTCTCTGCGCCTTTTTTTGCCTTCTTCATCTACTCCTGGTTCTTCATTGAGTCGGCCC 

1021 + + + + + + 1080 

ACCAGAGACGCGGAAAAAAACGGAAGAAGTAGATGAGGACCAAGAAGTAACTCAGCCGGG 

V S A,P FFAFF I Y SWF F I E S AR- 

GCTGGCACTCCTCCTCCGGGAGGCTGGACCTCACCCTGAGGGCCCTGCAGAGAGTCGCCC 

1081 + . + + + + + 1140 

CGACCGTGAGGAGGAGGCCCTCCGACCTGGAGTGGGACTCCCGGGACGTCTCTCAGCGGG 

WH S S SGRLDLTLRALQRVAR- 

GGATCAATGGGAAGCGGGAAGAAGGAGCCAAATTGAGTATGGAGGTACTCCGGGCCAGTC 
1141 -> + + + + + + 1200 
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CCTAGTTACCCTTCGCCCTTCTTCCT.CGGTTTAACTCATACCTCCATGAGGCCCGGTCAG 

INGKREEG AKLSMEVLRASL- 

.TGCAGAAGGAGCTGACCATGGGCAAAGGCCAGGCATCGGCCATGGAGCTGCTGCGCTGCC 

1201 + + + + - + + 1260 

ACGTCTTCCTCGACTGGTACCCGTTTCCGGTCCGTAGCCGGTACCTCGACGACGCGACGG 

QKELTMGKGQASAMELLRCP- 

CCACCCTCCGCCACCTCTTCCTCTGCCTCTCCATGCTGTGGTTTGCCACTAGCTTTGCAT 

1261 + + + + +1320 

■ GGTGGGAGGCGGTGGAGAAGGAGACGGAGAGGTACGACACCAAACGGTGATCGAAACGTA 

T L R H L FLC L S M L W F A T S FAYt 

ACTATGGGCTGGTCATGGACCTGCAGGGCTTTGGAGTCAGCATCTACCTAATCCAGGTGA 

1321 + + + + + + 1380 

TGATACCCGACCAGTACCTGGACGTCCCGAAACCTCAGTCGTAGATGGATTAGGTCCACT 

YGLVMDLQGFGVS IYL IQ V.I- 

TCTTTGGTGCTGTGGACCTGCCTGCCAAGCTTGTGGGCTTCCTTGTCATCAACTCCCTGG 

1381 + + + + + + 1440 

AGAAACCACGACACCTGGACGGACGGTTCGAACACCCGAAGGAACAGTAGTTGAGGGACC 

F G AVDL PAK L V G F LV I N S L ■ G - 

GTCGCCGGCCTGCCCAGATGGCTGCACTGCTGCTGGCAGGCATCTGCATCCTGCTCAATG 

1441 + + + + + + 1500 

CAGCGGCCGGACGGGTCTACCGACGTGACGACGACCGTCCGTAGACGTAGGACGAGTTAC 

RR PAQMAALLLAG I C I LLNG- 

GGGTGATACCCCAGGACCAGTCCATTGTCCGAACCTCTCTTGCTGTGCTGGGGAAGGGTT 

1501 + - : + - + + - + . + 1560 

CCCACTATGGGGTGCTGGTCAGGTAACAGGCTTGGAGAGAACGACACGACCCCTTCCCAA 

V I P.Q DQ S I VR T.S L AV L G KG C- 

GTCTGGCTGCCTCCTTCAACTGCATCTTCCTGTATACTGGGGAACTGTATCCCACAATGA 

1561 ; + . + + + + - + 1620 

CAGACCGACGGAGGAAGTTGACGTAGAAGGACATATGACCCCTTGACATAGGGTGTTACT 

LAASFNC IFLYTGELYPTMI- 

TCCGGCAGACAGGCATGGGAATGGGCAGCACCATGGCCCGAGTGGGCAGCATCGTGAGCC 

1621 + + + ---- + ---- + + 1680 

AGGCCGTCTGTCCGTACCCTTACCCGTCGTGGTACCGGGCTCACCCGTCGTAGCACTCGG 

RQTGMGMGSTMARVGS IVS P - 

CACTGGTGAGCATGACTGCCGAGCTCTACCCCTCCATGCCTCTCTTCATCTACGGTGCTG 
1681 + + + --- + - + + 1740 
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GTGACCACTCGTACTGACGGCTCGAGATGGGGAGGTACGGAGAGAAGTAGATGCCACGAC 
LVSMTAELYPSM PLF I Y G A V - 

<prpQQ«pQrpQQCCGCCAGCGCTGTCACTGTCCTCCTGCCAGAGACCCTGGGCCAGCCACTGC 
1741 --. + + + + + + 1800 

AAGGACACCGGCGGTCGCGACAGTGACAGGAGGACGGTCTCTGGGACCCGGTCGGTGACG 

PVAA.S AVTV L L P E T LG Q P L P - 
CAGACACGGTGCAGGACCTGGAGAGCAGGAAAGGGAAACAGACGCGACAGCAACAAGAGC 

i8oi + _- + + + ; + 1860 

GTCTGTGCCACGTCCTGGACCTCTCGTCCTTTCCCTTTGTCTGCGCTGTCGTTGTTCTCG 

DTVQDLESR KGKQT RQQQ EH- 

ACCAGAAGTATATGGTCCCACTGCAGGCCTCAGCACAAGAGAAGAATGGACTCTGAGGAC 

1861 + - + + + + + 1920 

TGGTCTTCATATACCAGGGTGACGTCCGGAGTCGTGTTCTCTTCTTACCTGAGACTCCTG 1 

QKY MV PLQASAQEK NGL* 

TGAGAAGGGGCCTTACAGAACCCTAAAGGGAGGGAAGGTCCTACAGGTCTCCGGCCACCC 

1921 + + + + + + 1980 

ACTCTTCCCCGGAATGTCTTGGGATTTCCCTCCCTTCCAGGATGTCCAGAGGCCGGTGGG 

ACACAAGGAGGAGGAAGAGGAAATGGTGACCCAAGTGTGGGGGTTGTGGTTCAGGAAAGC 

1981 + + ■ + + + + 2040 

TGTGTTCCTCCTCCTTCTCCTTTACCACTGGGTTCACACCCCCAACACCAAGTCCTTTCG 

ATCTTCCCAGGGGTCCACCTCCCTTTATAAACCCCACCAGAACCACATCATTAAAAGGTT 

2041 + + + + + + 2100 

TAGAAGGGTCCCCAGGTGGAGGGAAATATTTGGGGTGGTCTTGGTGTAGTAATTTTCCAA 

TGACTGCGCACCAAAAAAAAAAAAAAA 

2101 - + + 2127 

ACTGACGCGTGGTTTTTTTTTTTTTTT 
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CTGCACCTGAAGCATTTGGTGGGTGAGCAGCATGGGCTTTGAGGAGCTGCTGGAGCAGGT 
1 + + + + + + 60 

GACGTGGACTTCGTAAACCACCCACTCGTCGTACCCGAAACTCCTCGACGACCTCGTCCA 

MGFEELLEQV- 

GGGCGGCTTTGGGCCCTTCCAACTGCGGAATGTGGCACTGC.TGGCCCTGCCCCGAGTGCT 

61 ■ + + + + + + 120 

CCCGCCGAAACCCGGGAAGGTTGACGCCTTACACCGTGACGACCGGGACGGGGCTCACGA 

GGF GPFQLRNVALLALPRVL- 

GCTACCACTGCACTTCCTCCTGCCCATCTTCCTGGCTGCCGTGCCTGCCCACCGATGTGC 

121 + + + + + + 180 

CGATGGTGACGTGAAGGAGGACGGGTAGAAGGACCGACGGGACGGACGGGTGGCTACACG 

L PLHFLL P I FLAA V PAH RCA 

CCTGCCGGGTGCCCCTGCCAACTTCAGCCATCAGGATGTGTGGCTGGAGGCCCATCTTCC 

181 + + + --- + + + 240 

GGACGGCCCACGGGGACGGTTGAAGTCGGTAGTCCTACACACCGACCTCCGGGTAGAAGG 

L P GA PAN F S H Q DVW L EA H L P 

CCGGGAGCCTGATGGCACGCTCAGCTCCTGCCTCCGCTTTGCCTATCCCCAGGCTCTCCC 

241 + + + + + + 300 

GGCCCTCGGACTACCGTGCGAGTCGAGGACGGAGGCGAAACGGATAGGGGTCCGAGAGGG 

REPDGTLSSCLRFAYPQALP- 

CAACACCACGTTGGGGGAAGAAAGGCAGAGCCGTGGGGAGCTGGAGGATGAACCTGCCAC 

301 + + + + + + 360 

GTTGTGGTGCAACCCCCTTCTTTCCGTCTCGGCACCCCTCGACCTCCTACTTGGACGGTG 

N'T T L G E E. R Q S R G E L E D E P A T - 

AGTGCCCTGCTCTCAGGGCTGGGAGTACGACCACTCAGAATTCTCCTCTACCATTGCAAC ' 
361 + + + + + + 420 

TCACGGGACGAGAGTCCCGACCCTCATGCTGGTGAGTCTTAAGAGGAGATGGTAACGTTG 

VPCSQGWEYDHSEFSSTIAT- 

TGAGTGGGATCTGGTGTGTGAGCAGAAAGGTCTGAACAGAGCTGCGTCCACTTTCTTCTT 

421 + + + + + + 480 

ACTCACCCTAGACCACACACTCGTCTTTCCAGACTTGTCTCGACGCAGGTGAAAGAAGAA 

EWDLVC'EQKGLNRAASTFFF- 

CGCCGGTGTGCTGGTGGGGGCTGTGGCCTTTGGATATCTGTCCGACAGGTTTGGGCGGCG 

481 + + + --- + + + 540 

GCGGCCACACGACCACCCCCGACACCGGAAACCTATAGACAGGCTGTCCAAACCCGCCGC 

AGVLVGAVAFGYLS DRFGRR 
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GCGTCTGCTGCTGGTAGCCTACGTGAGTACCCTGGTGCTGGGCCTGGCATCTGCAGCCTC 

541 + + + + + + 600 

CGCAGACGACGACCATCGGATGCACTCATGGGACCACGACCCGGACCGTAGACGTCGGAG 

RLLL VAYVSTLVLGLASA AS 

CGTCAGCTATGTAATGTTTGCCATCACCCGCACCCTTACTGGCTCAGCCCTGGCTGGTTT 
501 --- + + + + + + 660 

GCAGTCGATACATTACAAACGGTAGTGGGCGTGGGAATGACCGAGTCGGGACCGAGCAAA 

V S"Y V • M F A I T R T L T G S A L A G F - 

TACCATCATCGTGATGCCACTGGAGCTGGAGTGGCTGGATGTGGAGCACCGCACCGTGGC 

661 + + --- + + + + 720 

ATGGTAGTAGCACTACGGTGACCTCGACCTCACCGACCTACACCTCGTGGCGTGGCACCG 

ti ivmpl'elewld. .VEH RTVA- 

TGGAGTCCTGAGCAGCACCTTCTGGACAGGGGGCGTGATGCTGCTGGCACTGGTTGGGTA 

721 + + + + + + 780 

ACCTCAGGACTCGTCGTGGAAGACCTGTCCCCCGCACTACGACGACCGTGACCAACCCAT 

GVLS STFWTGG" 'VMLLALVGY 

CCTGATACGGGACTGGCGATGGCTTCTGCTAGCTGTCACCCTGCCTTGTGCCCCAAGCAT 
781 + + + - + + + 840 

GGACTATGCCCTGACCGCTACCGAAGACGATCGACAGTGGGACGGAACACGGGGTTCGTA 

lir'dwrwlllavtlpcapsi - 
cctcagcctctggtgggtgcctgagtctgcacgctggcttctgacccaaggccatgtgaa 

841 + + + + + + 900 

GGAGTCGGAGACCACCCACGGACTCAGACGTGCGACCGAAGACTGGGTTCCGGTACACTT 

LSLWWVPESARWLLTQGH'VK 

AGAGGCCCACAGGTACTTGCTCCACTGTGCeAGGCTCAATGGGCGGCCAGTGTGTGAGGA 

901 + + + + + + 960 

TCTCCGGGTGTCCATGAACGAGGTGACACGGTCCGAGTTACCCGCCGGTCACACACTCCT 

EAHRYLLHCARL NG RPVCED- 

CAGCTTCAGCCAGGAGGCTGTGAGCAAAGTGGCCGCCGGGGAACGGGTGGTCCGAAGACC ■ 

961 + + + + + + 1020 

GTCGAAGTCGGTCCTCCGACACTCGTTTCACCGGCGGCCCCTTGCCCACCAGGCTTCTGG 

SFSQEAVSKVAAGERVVRRP- 

. TTCATACCTAGACCTGTTCCGCACACCACGGCTCCGACACATCTCACTGTGCTGCGTGGT 

1021 + + + + + + 1080 

AAGTATGGATCTGGACAAGGCGTGTGGTGCCGAGGCTGTGTAGAGTGACACGACGCACCA 

SYLDLFRTPRLRHI SLCCVV- 
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GGTGTGGTTCGGAGTGAACTTCTCCTATTACGGCCTGAGTCTGGATGTGTCGGGGCTGGG 
1081 + + + + + + H40 

CCACACCAAGCCTCACTTGAAGAGGATAATGCCGGACTCAGACCTACACAGCCCCGACCC 

VWFGVNFSYYGLSLDVSGLG- 

GCTGAACGTGTACCAGACACAGCTGTTGTTCGGGGCTGTGGAACTGCCCTCCAAGCTGCT 

1141 + + + + + ■+ 1200 

CGACTTGCACATGGTCTGTGTCGACAACAAGCCCCGACACCTTGACGGGAGGTTCGACGA 

LNVYQTQLLFGAV E L P S KLL 

GGTCTACTTGTCGGTGCGCTACGCAGGACGCCGCCTCACGCAAGCCGGGACACTGCTGGG . 

1201 + + + + + + 1260 

CCAGATGAACAGCCACGCGATGCGTCCTGCGGCGGAGTGCGTTCGGCCCTGTGACGACCC 

vylsvryagrrlt'qag TLLG - 

CACGGCCCTGGCGTTCGGCACTAGACTGCTAGTGTCCTCCGATATGAAG.TCCTGGAGCAC 

1261 ■ + . -f + + + + 1320 

GTGCCGGGACCGCAAGCCGTGATCTGACGATCACAGGAGGCTATACTTCAGGACCTCGTG 

TA LAFGTR LLV.S SDMK SWS.T 

TGTCCTGGCAGTGATGGGGAAAGCTTTTTCTGAAGCTGCCTTCACCACTGCCTACCTGTT 

1321 + + T - + + + + 1380 

ACAGGACCGTCACTACCCCTTTCGAAAAAGACTTCGACGGAAGTGGTGACGGATGGACAA 

VLAVMGKAFSEAAFTT'AYLF- 

CACTTCAGAGTTGTACCCTAC'GGTGCTCAGACAGACAGGGATGGGGCTGACTGCACTGGT 

1381 + + + - + + --- + 1440 

GTGAAGTCTCAACATGGGATGCCACGAGTCTGTCTGTCCCTACCCCGACTGACGTGACCA 

T S ELY PT VLR QTGMG L T A L V - 

GGGCCGGCTGGGGGGCTCTTTGGCCCCACTGGCGGCCTTGCTGGATGGAGTGTGGCTGTC 

1441 + + + -- + + + 1500 

CCCGGCCGACCCCCCGAGAAACCGGGGTGACCGCCGGAACGACCTACCTCACACCGACAG 

GRLGGSLAPLAALLDGVWLS 

ACTGCCCAAGCTTACTTATGGGGGGATCGCCCTGCTGGCTGCCGGCACCGCCCTCCTGCT 

1501 + + + + + 1560 

TGACGGGTTCGAATGAATACCCCCCTAGCGGGACGACCGACGGCCGTGGCGGGAGGACGA 

LPKLTYGGIALLAAGTALLL- 

GCCAGAGACGAGGCAGGCACAGCTGCCAGAGACCATCCAGGACGTGGAGAGAAAGAGTGC 

1561 + + + + + + .1620 

CGGTCTCTGCTCCGTCCGTGTCGACGGTCTCTGGTAGGTCCTGCACCTCTCTTTCTCACG 

PETRQAQLPETIQDVERKSA- 
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CCCAACCAGTCTTCAGGAGGAAGAGATGCCCATGAAGCAGGTCCAGAACTAAGTGGGAGT 

1621 + + + + -- --- + + 1680 

GGGTTGGTCAGAAGTCCTCCTTCTCTACGGGTACTTCGTCCAGGTCTTGATTCACCCTCA 

PTSLQEE EMPMKQVQN* 

GGAGGCAGGCCCTCCACAGAAGCTCTGCAGCAGGGGCTGGGAGAGCAGAAGGGCAGGCCC 

1681 + + + + + + 1740 

CCTCCGTCCGGGAGGTGTCTTCGAGACGTCGTCCCCGACCCTCTCGTCTTCCCGTCCGGG 

TGCAACTCAGGCTGGGAGTATCGAACCCTCTGCCTAGGGCCGGAGTTGCTGCCAGTACCC 

1741 -+ + + + + + 1800 

ACGTTGAGTCCGACCCTCATAGCTTGGGAGACGGATCCCGGCCTCAACGACGGTCATGGG 

GCTCCCTCTGCTCATCCATCCTTGATTATTTGGCTTCTAGGAACAGTTGACTTCCCAGAA 

1801 + + + + + + 1860 

CGAGGGAGACGAGTAGGTAGGAACTAATAAACCGAAGATCCTTGTCAACTGAAGGGTCTT 

TGCAGTGGGCTGCTGGGCACCCCTCTCACGGTTGGGGAGGATTCTGTAAATAAAGGTGCC 

1 861 : + + + + + -+ 1920. 

ACGTCACCCGACGACCCGTGGGGAGAGTGCCAACCCCTCCTAAGACATTTATTTCCACGG 

CCTTGGGTTGGGGCAATGGTGACGAGCTGTGGGAAGAGCCCTGGATAGGAAGCCACTGAG ' . 

1921 + + + + -- + + 1980 

GGAACCCAACCCCGTTACCACTGCTCGACACCCTTCTCGGGACCTATCCTTCGGTGACTC 

. TCTGCCCTGGGCTCTGATAAAACCTTCACCATTAACTTGCTGTGTGACCTTGGGCATGTG 

1981. + + + + + + 2040 

AGACGGGACCCGAGACTATTTTGGAAGTGGTAATTGAACGACACACTGGAACCCGTACAC 

GCTTTCCCTCTCTGGCCTCAGTCTGTTCATCTCCCAAATGGATAATGAAGCCTCTTGGGA 

2041 + + + + + + 2100 

CGAAAGGGAGAGACCGGAGTCAGACAAGTAGAGGGTTTACCTATTACTTCGGAGAACCCT 

GGCCCTACCATAGGATCTGTTGCCATGCTCAAATGAGTTACTGAATAAGGTGCTTCTGCT 

2101 + + + - + --- + + 2160 

" CCGGGATGGTATCCTAGACAACGGTAGGAGTTTACTCAATGACTTATTCCACGAAGACGA ■ 

TCTTCTAGAGATGGTGCTAAAGAAAGGACTAGCATATGAGAGTTCTGGTACCAATGGGGC 

2161 + + + + + + 2220 

AGAAGATCTCTACCACGATTTCTTTCCTGATCGTATACTCTGAAGACCATGGTTACCCCG 

TGGTGGGCATGCTGTCCACTGTGTGGTGCTAGGACTGCCAATGCCAGGCCCAAGGGACAA 

2221 + + --- + + + + 2280 

ACCACCCGTACGACAGGTGACACACCACGATCCTGACGGTTACGGTCCGGGTTCCCTGTT 

AAAGAACAGAGCTTTTTGTTCTCATGGCTGGCCCTGCTACCTCCGAGGCACCCTGCAGGG 

2281 + + + + - + + 2340 

TTTCTTGTCTCGAAAAACAAGAGTACCGACCGGGACGATGGAGGCTCCGTGGGACGTCCC 

CAATGCATGTCATCCCAACCCCCACACTCCCCATCCTCCAACCCACTGGTCTCATGCCCA 

2341 + + + + + ---- + 2400 • 

GTTACGTACAGTAGGGTTGGGGGTGTGAGGGGTAGGAGGTTGGGTGACCAGAGTACGGGT 
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AAGAAGAGTTGAAGGCATGGGAGCCAACATTTTATTGAAGAAGCCACAGAGGCTGAAATT 

2401 + + + + + + 2460 

TTCTTCTCAACTTCCGTACCCTCGGTTGTAAAATAACTTCTTCGGTGTCTCCGACTTTAA ' 

CAATAAACACAAGTTTTATGAGTAAAAAAAAAAAAAAAAAA 

2461 + + + + - 2501 

GTTATTTGTGTTCAAAATACTCATTTTTTTTTTTTTTTTTT 
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■ CTGCACCTGAAGCATTTGGTGGGTGAGCAGCATGGGCTTTGAGGAGCTGCTGGAGCAGGT 

1 + + + + + + £Q 

GACGTGGACTTCGTAAACCACCCACTCGTCGTACCCGAAACTCCTCGACGACCTCGTCCA 

MGFEE.LLE QV- 

GGGCGGCTTTGGGCCCTTCCAACTGCGGAATGTGGCACTGCTGGCCCTGCCCCGAGTGCT 

61 + + + -__ + + + 120 

CCCGCCGAAACCCGGGAAGGTTGACGCCTTACACCGTGACGACCGGGACGGGGCTCACGA 

GGFGPFQLRNVALLALP RVL- 

GCTACCACTGCACTTCCTCCTGCCCATCTTCCTGGCTGCCGTGCCTGCCCACCGATGTGC 

121 + + + + + + 180 

CGATGGTGACGTGAAGGAGGACGGGTAGAAGGACCGACGGCACGGACGGGTGGCTACACG 

L P L.H F L L P ■ I F L A A V P A H R C A -" 

CCTGCCGGGTGCCCCTGCCAACTTCAGCCATCAGGATGTGTGGCTGGAGGCCCATCTTCC 
181 + _ + + + * + + 240 

GGACGGCCCACGGGGACGGTTGAAGTCGGTAGTCCTACACACCGACCTCCGGGTAGAAGG 

LPGAPANFSHQDV WLEAHLP- 

CCGGGAGCCTGATGGCACGCTCAGCTCCTGCCTCCGCTTTGCCTATCCCCAGGCTCTCCC 
241 + + - + + + + 300 

GGCCCTCGGACTACCGTGCGAGTCGAGGACGGAGGCGAAACGGATAGGGGTCCGAGAGGG 

REPDGTLSSCLRFAYPQALP- 

CAACACCACGTTGGGGGAAGAAAGGCAGAGCCGTGGGGAGCTGGAGGATGAACCTGCCAC 
301 + + ~- + + + + 360 

GTTGTGGT.GCAACCCCCTTCTTTCCGTCTCGGCACCCCTCGACCTCCTACTTGGACGGTG 

NTTLGEERQS. RGELEDEPAT- 

AGTGCCCTGCTCTCAGGGCTGGGAGTACGACCACTCAGAATTCTCCTCTACCATTGCAAC 

361 + + + + + + 420 

TCACGGGACGAGAGTCCCGACCCTCATGCTGGTGAGTCTTAAGAGGAGATGGTAACGTTG 

V P C S'Q GWEYDH S E F S S T I AT 

TGAGTGGGATCTGGTGTGTGAGCAGAAAGGTCTGAACAGAGCTGCGTCCACTTTCTTCTT 

421 + + + + . + + 480 

ACTCAGCCTAGACCACACACTCGTCTTTCCAGACTTGTCTCGACGCAGGTGAAAGAAGAA 

E " W D L V C EQKG LN R A A S T F F F - 

CGCCGGTGTGCTGGTGGGGGCTG'TGGCCTTTGGATATCTGTCCGACAGGTTTGGGCGGCG 
481 + + + + + + 540 

GCGGCCACACGACCACCCCCGACACCGGAAACCTATAGACAGGCTGTCCAAACCCGCCGC 
AGVLVGA VAFGYL S DRFGRR 
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GCGTCTGCTGCTGGTAGCCTACGTGAGTACCCTGGTGCTGGGCCTGGCATCTGCAGCCTC 
541 + + + + + + goo 

CGCAGACGACGACCATCGGATGCACTCATGGGACCACGACCCGGACCGTAGACGTCGGAG 

RLLLVAYVSTLVLG L A SA A S - 

CGTCAGCTATGTAATGTTTGCCATCACCCGCACCCTTACTGGCTCAGCCCTGGCTGGTTT 

•601 + + + + + + 660 

GCAGTCGATACATTACAAACGGTAGTGGGCGTGGGAATGACCGAGTCGGGACCGACCAAA 

V S YVM F . A I TR T L TG S A L AG F - 

TACCATCATCGTGATGCCACTGGAGCTGGAGTGGCTGGATGTGGAGCACCGCACCGTGGC 
661- + + + + + + 720 

ATGGTAGTAGCACTACGGTGACCTCGACCTCACCGACCTACACCTCGTGGCGTGGCACCG 

TI IVMPLELEWLD VEHRTVA- 

TGGAGTCCTGAGCAGCACCTTCTGGACAGGGGGCGTGATGCTGCTGGCACTGGTTGGGTA 
721 — + + + + + + 780 

ACCTCAGGACTCGTCGTGGAAGACCTGTCCCCCGCACTACGACGACGGTGACCAACCCAT 
GVL S S T FW.TG G.VML LA LVGY - . 

CCTGATACGGGACTGGCGATGGCTTCTGCTAGCTGTCACCCTGCCTTGTGCCCCAAGCAT 
781 + + + + + + 840 

GGACTATGCCCTGACCGCTACCGAAGACGATCGACAGTGGGACGGAACACGGGGTTCGTA 

L IRDWRWLLLAVTL PCAP S I - 

CCTCAGCCTCTGGTGGGTGCCTGAGTCTGCACGCTGGCTTCTGACCCAAGGCCATGTGAA 

841 + + + + + + 900 

GGAGTCGGAGACCACCCACGGACTCAGACGTGCGACCGAAGACTGGGTTCCGGTACACTT 

LSLWWVPESA/RWLLTQ.GHVK- 

AGAGGCCCACAGGTACTTGCTCCACTGTGCCAGGCTCAATGGGCGGCCAGTGTGTGAGGA 
901 + + + ^_^. + + + 9 6o 

TCTCCGGGTGTCCATGAACGAGGTGACACGGTCCGAGTTACCCGCCGGTCACACACTCCT 

E A H RY L'L H CA R L N G R P VC E D 

CAGCTTCAGCCAGGAGGCTGTGAGCAAAGTGGCCGCCGGGGAACGGGTGGTCCGAAGACC 

961 " + + + -- + + + 1020 

•GTCGAAGTCGGTCCTCCGACACTCGTTTCACCGGCGGCCCCTTGCCCACCAGGCTTCTGG 

SFSQE-AVSKVAAGERVVRRP- 

TTCATACCTAGACCTGTTCCGCACACCACGGCTCCGACACATCTCACTGTGCTGCGTGGT 

1021 + + + + + + 1080 

AAGTATGGATCTGGACAAGGCGTGTGGTGCCGAGGCTGTGTAGAGTGACACGACGCACCA 

SYLDLFR TPRLRHISLCCVV- 
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GGTGTGGTTCGGAGTGAACTTCTCCTATTACGGCCTGAGTCTGGATGTGTCGGGGCTGGG 

2081 + + + + + +1140 

CCACACCAAGCCTCACTTGAAGAGGATAATGCCGGACTCAGACCTACACAGCCCCGACCC 

VWFGVNFSYY GLSLDVSGLG -' 

GCTGAACGTGTACCAGACACAGCTGTTGTTCGGGGCTGTGGAACTGCCCTCCAAGCTGCT 

1141 + '- + + + + + 1200 

CGACTTGCACATGGTCTGTGTCGACAACAAGCCCCGACACCTTGACGGGAGGTTCGACGA 

LNVYQTQLLFGA VELPSKLL- 

GGTCTACTTGTCGGTGCGCTACGCAGGACGCCGCCTCACGCAAGCCGGGACACTGCTGGG 

1201 + + + + + + 1260 

CCAGATGAACAGCCACGCGATGCGTCCTGCGGCGGAGTGCGTTCGGCCCTGTGACGACCC 

VY-LSVRYAGRRLT-QAGTLLG 

CACGGCCCTGGCGTTCGGCACTAGACTGCTAGTGTCCTCCGATATGAAGTCCTGGAGCAC 

1261 + + + + -- + + 1320 

GTGCCGGGACCGCAAGCCGTGATCTGACGATCACAGGAGGCTATACTTCAGGACCTCGTG- 

TA LAF.G TRLLVS S DMK SW S T 

TGTCCTGGCAGTGATGGGGAAAGCTTTTTCTGAAGCTGCCTTCACCACTGCCTACCTGTT 

1321 + + + + + 4 1380 

ACAGGACCGTCACTACCCCTTTCGAAAAAGACTTCGACGGAAGTGGTGACGGATGGACAA 

VLAV 'MGKA. FS EAA FTT AYLF - 

CACTTCAGAGTTGTACCCTACGGTGCTCAGACAGACAGGGATGGGGCTGACTGCACTGGT 

1381 + - + --- + + + + 1440 

GTGAAGTCTCAACATGGGATGCCACGAGTCTGTCTGTCCCTACCCCGACTGACGTGACCA 

TSELYPT VLRQ.T GMGL 'TALV- 

GGGCCGGCTGGGGGGCTCTTTGGCeeCACTGGCGGCCTTGCTGGATGGAGTGTGGCTGTC 
1441 + + + + + + 1500 

CCCGGCCGACCCCCCGAGAAACCGGGGTGACCGCCGGAACGACCTACCTCACACCGACAG 

GRLGGSLAPLAALLDGVWLS 

ACTGCCCAAGCTTACTTATGGGGGGATCGCCCTGCTGGCTGCCGGCACCGCCCTCCTGCT 

1501 + + + + + - + 1560 

TGACGGGTTCGAATGAATACCCCCCTAGCGGGACGACCGACGGCCGTGGCGGGAGGACGA 

L P K L T YGG I A L LA AG T A L L L 

GCCAGAGACGAGGCAGGCACAGCTGCCAGAGACCATCCAGGACGTGGAGAGAAAGAGAGA 

1561 + + +■ + + + 1620 

CGGTCTCTGCTCCGTCCGTGTCGACGGTCTCTGGTAGGTCCTGCACCTCTCTTTCTCTCT 

PETRQAQLPETIQDVERKRD- 
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• TGGTGCTAAAGAAAGGACTAGCATATGAGACTTCTGGTACCAATGGGGCTGGTGGGCATG 



1621 + + + + + + 1680 

ACCACGATTTCTTTCCTGATCGTATACTCTGAAGACCATGGTTACCCCGACCACCCGTAC 

G AKERT SI * 

CTGTCCACTGTGTGGTGCTAGGACTGCCAATGCCAGGCCCAAGGGACAAAAAGAACAGAG 

1681 + + + + + + 1740 

GACAGGTGACACACCACGATCCTGACGGTTACGGTCCGGGTTCCCTGTTTTTCTTGTCTC 

CTTTTTGTTCTCATGGCTGGCCCTGCTACCTCCGAGGCACCCTGCAGGGCAATGCATGTC 

1741 + + + + + + 1800 

GAAAAACAAGAGTACCGACCGGGACGATGGAGGCTCCGTGGGACGTCCCGTTACGTACAG 

ATCCCAACCCCCACACTCCCCATCCTCCAACCCACTGGTCTCATGCCCAAAGAAGAGTTG 

1801 + + + + + + 1860 

TAGGGTTGGGGGTGTGAGGGGTAGGAGGTTGGGTGACCAGAGTACGGGTTTCTTCTCAAC ' 

AAGGCATGGGAGCCAACATTTTATTGAAGAAGCCACAGAGGCTGAAATTCAATAAACACA 

1861 + + + + + +. 1920 

TTCCGTACCCTCGGTTGTAAAATAACTTCTTCGGTGTCTCCGACTTTAAGTTATTTGTGT 

AGTTTTATGAGTAAAAAAAAAAAAAAAAAA 

1921 + + + 1950 

TCAAAATACTCATTTTTTTTTTTTTTTTTT 
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GCAGGGACCTCAACTACACTGATCACCAGCCCCATCGGATCCAGACCCGGCCACCAGTGC 

I + - + +• + + + 60 

CGTCCCTGGAGTTGATGTGACTAGTGGTCGGGGTAGCCTAGGTCTGGGCCGGTGGTCACG 

CATGACCTTCTCGGAGATCCTGGACCGTGTGGGAAGCATGGGCCATTTCCAGTTCCTGCA 

61 r + + + + + -4- 120 

GTACTGGAAGAGCCTCTAGGACCTGGCACACCCTTCGTACCCGGTAAAGGTCAAGGACGT 

MTFSEI LDRVGSMGHFQFLH 

TGTAGCCATACTGGGCCTCCCGATCCTCAACATGGCCAACCACAACCTGCTGCAGATCTT 

121 + + + + + -- + 180 

ACATCGGTATGACCCGGAGGGCTAGGAGTTGTACCGGTTGGTGTTGGACGACGTCTAGAA 

VAILGLPIL NMANHNLLQIF- 

CACAGCCGCCACCCCTGTCCACCACTGTCGCCCGCCCCACAATGCCTCCACAGGGCCTTG 
181 + + -+ + + + 240 

GTGTCGGCGGTGGGGACAGGTGGTGACAGCGGGCGGGGTGTTACGGAGGTGTCCCGGAAC 

T A. A T PV H H C R P P H NA S T G P W 

GGTGCTCCCCATGGGCCCAAATGGGAAGCCTGAGAGGTGCCTCCGTTTTGTACAT.CCGCC 

241 + --- + + + + - + 300 

CCACGAGGGGTACCCGGGTTTACCCTTCGGACTCTCCACGGAGGCAAAACATGTAGGCGG 

VLPMGPNGKPERC LRFVHPP- 

CAATGCCAGCCTGCCCAATGACACCCAGAGGGCCATGGAGCCATGCCTGGATGGCTGGGT 

301 + - + .+ + 4 + 360 

GTTACGGTCGGACGGGTTACTGTGGGTCTCCCGGTACCTCGGTACGGACCTACCGACCCA 

NASLPNDTQ RAMEPCLD GWV- 

CTACAACAGCACCAAGGACTCCATTGTGACAGAGTGGGACTTGGTGTGCAACTCCAACAA 

361 + + + + + : + 420 

GATGTTGTCGTGGTTCCTGAGGTAACACTGTCTCACCCTGAACCACACGTTGAGGTTGTT 

YNSTKDS IVTEWDLVCNSNK- 

ACTGAAGGAGATGGCCCAGTCTATCTTCATGGCAGGTATACTGATTGGAGGGCTCGTGCT 

421 + + + + + + 480 

TGACTTCCTCTACCGGGTCAGATAGAAGTACCGTCCATATGACTAACCTCCCGAGCACGA 

LKEMAQSIFMAGl'LIGGLVL- 

TGGAGACCTGTCTGACAGGTTTGGCCGCAGGCCCATCCTGACCTGCAGCTACCTGCTGCT 

481 + . -- + 4 + + + 540 

ACCTCTGGACAGACTGTCCAAACCGGCGTCCGGGTAGGACTGGACGTCGATGGACGACGA 

GDLSDRFGRRPIL- TCSYLLL - 

GGCAGCCAGCGGCTCCGGTGCAGCCTTCAGCCCCACCTTCCCCATCTACATGGTCTTCCG 
541 r + + --- + + + + 600 
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CCGTCGGTCGCCGAGGCCACGTCGGAAGTCGGGGTGGAAGGGGTAGATGTACCAGAAGGC 
AASG S G A A F S PTF P IYMVFR - 

CTTCCTGTGTGGCTTTGGCATCTCAGGCATTACCCTGAGCACCGTCATCTTGAATG.TGGA 

601 - + + + + + + 660 

GAAGGACACACCGAAACCGTAGAGTCCGTAATGGGACTCGTGGCAGTAGAACTTACACCT 

FLCGFGI SGITLSTVILNVE- 

ATGGGTGCCTACCCGGATGCGGGCCATCATGTCGACAGCACTCGGGTACTGCTACACCTT 

661 + + : + + + 720 

TACCCACGGATGGGCCTACGCCCGGTAGTACAGCTGTCGTGAGCCCATGACGATGTGGAA 

WVPTRMRAIMSTA LGYCYTF- 

TGGCCAGTTCATTCTGCCCGGCCTGGCCTACGCCATCCCCCAGTGGCGTTGGCTGCAGTT 

721 + + -- + + + + 780 

ACCGGTCAAGTAAGACGGGCCGGACCGGATGCGGTAGGGGGTCACCGCAACCGACGTCAA 

GQFI LPGLAY A I PQWR.WLQL - 

AACTGTGTCCATTCCCTTCTTCGTCTTCTTCCTATCATCCTGGTGGACACCAGAGTCCAT 

781' + + + ---+ + - : + 840 

TTGACACAGGTAAGGGAAGAAGCAGAAGAAGGATAGTAGGACCACCTGTGGTCTCAGGTA 

TVSI PFFV FFLSSWWTP ESI - 

ACGCTGGTTGGTCTTGTCTGGAAAGTCCTCGAAGGCCCTGAAGATACTCCGGCGGGTGGC 

841 + + . + + + 900 

TGCGACCAACCAGAACAGACCTTTCAGGAGCTTCCGGGACTTCTATGAGGCCGCCCACCG 

RWLV LS GKSSKALK ILRRVA- 

TGTCTTCAATGGCAAGAAGGAAGAGGGAGAAAGGCTCAGCTTGGAGGAGCTCAAACTCAA 

901 + + + + +" + 960 

ACAGAAGTTACCGTTCTTCCTTCTCCCTCTTTCCGAGTCGAACCTCCTCGAGTTTGAGTT 

V FNG KKEEGERLSLEELKLN- 

CCTGCAGAAGGAGATCTCCTTGGCCAAGGCCAAGTACACCGCAAGTGACCTGTTCCGGAT • 

961 + + + + + + 1020 

GGACGTCTTCCTCTAGAGGAACCGGTTCCGGTTCATGTGGCGTTCACTGGACAAGGCCTA 

LQKE i S LAKAKYTA SDL FR I - 

ACCCATGCTGCGCGGCATGACCTTCTGTCTTTCCCTGGCCTGGTTTGCTACCGGTTTTGC 

1021 + . + + + + + 1080 

TGGGTACGACGCGGCGTACTGGAAGACAGAAAGGGACCGGACCAAACGATGGCCAAAACG 

P'MLRRMTFCLSL'AWFATGFA- 

CTACTATAGTTTGGCTATGGGTGTGGAAGAATTTGGAGTCAACCTCTACATCCTCCAGAT 
1081 r + + ~ + -- + + + 1140 
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GATGATATCAAACCGATACCCACACCTTCTTAAACCTCAGTTGGAGATGTAGGAGGTCTA 

YYSL AM GVEEFGVN LY.IL QI - 

CATCTTTGGTGGGGTCGATGTCCCAGCCAAGTTCATCACCATCCTCTCCTTAAGCTACCT 

1141 + + + - + ---- + + 1200 

GTAGAAACCACCCCAGCTACAGGGTCGGTTCAAGTAGTGGTAGGAGAGGAATTCGATGGA 

IFGGVDVPAKFITILSLSYL- 

GGGCCGGCATACCACTCAGGCCGCTGCCCTGCTCCTGGCAGGAGGGGCCATCTTGGCTCT 

1201 + + ■+ + + ---- -+ 1260 

CCCGGCCGTATGGTGAGTCCGGCGACGGGACGAGGACCGTCCTCCCCGGTAGAACCGAGA 

G R H T T . Q • A A A L L LA G G A I LA L - 

CACCTTTGTGCCCTTGGACTTGCAGACCGTGAGGACAGTATTGGCTGTGTTTGGGAAGGG 

1261 + + + + + + 1320 

GTGGAAACACGGGAACCTGAACGTCTGGCACTCCTGTCATAACCGACACAAACCCTTCCC 

T FV P L D L Q T'V'R TV L A , V . F G'KG - 

ATGCCTATCCAGCTCCTTGAGCTGCCTCTTCCTCTACACAAGTGAATTATACCCCACAGT 

1321 -: + + + + + + 1380 

TACGGATAGGTCGAGGAAGTCGACGGAGAAGGAGATGTGTTCACTTAATATGGGGTGTCA 

CLSSSFSCLFLYTSELYPTV- 

CATCAGGCAAACAGGTATGGGCGTAAGTAACCTGTGGACCCGCGTGGGAAGCATGGTGTC 

1381 + + + + + + 1440 

GTAGTCCGTTTGTCCATACCCGCATTCATTGGACACCTGGGCGCACCCTTCGTACCACAG 

IRQTG'M GV.SNLWT-RVGSMVS - 

CCCGCTGGTGAAAATCACGGGTGAGGTACAGCCCTTCATCCCCAATATCATCTACGGGAT 

1441 + +: + + + + 1500 

GGGCGACCACTTTTAGTGCCCACTCCATGTCGGGAAGTAGGGGTTATAGTAGATGCCCTA 

P LV K I T G EVQ P F I P N I I Y G I " 

CACCGCCCTCCTCGGGGGCAGTGCTGCCCTCTTCCTGCCTGAGACCCTGAATCAGCCCTT 

1501 + + + ---- + + ---+ 1560 

GTGGCGGGAGGAGCCCCCGTCACGACGGGAGAAGGACGGACTCTGGGACTTAGTCGGGAA 

TALLGGSAALFLPE TLNQPL- 

GCCAGAGACTATCGAAGACCTGGAAAACTGGTCCCTGCGGGCAAAGAAGCCAAAGCAGGA 

1561 + . + + + + + 1620 

CGGTCTCTGATAGCTTCTGGACCTTTTGACCAGGGACGCCCGTTTCTTCGGTTTCGTCCT 

PET IEDLENWSLRAKKPKQE- 

GCCAGAGGTGGAAAAGGCCTCCCAGAGGATCCCTCTACAGCCTCACGGACCAGGCCTGGG 
1621 r + + 4 + + + 1680 
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CGGTCTCCACCTTTTCCGGAGGGTCTCCTAGGGAGATGTCGGAGTGCCTGGTCCGGACCC 

PEVEKASQRIPLQPHGPGLG- 

CTCCAGCTGAGGACAACGGAACCCCCTTTCCCTGCCCTCCAGAGACTGATCCTAGCCAGG 

1681 --- + + + --- + + + 1740 

GAGGTCGACTCCTGTTGCCTTGGGGGAAAGGGACGGGAGGTCTCTGACTAGGATCGGTCC 

S S * 

CACCTTAGGAGTATAGGGAGGCCCCATATAGGTCCATCCTCCTAGGATGAAGCCTTCTGA 

1741 + + + - + + : + 1800 

GTGGAATCCTCATATCCCTCCGGGGTATATCCAGGTAGGAGGATCCTACTTCGGAAGACT 

GAGCTTGGTGAAGGTGTCTCCATCACCACCACCAGAGCCTCCTGCCCAGCCCTGGCCAGT 

1801 + + + + + + i860 

CTCGAACCACTTCCACAGAGGTAGTGGTGGTGGTCTCGGAGGACGGGTCGGGACCGGTCA 

TCAAAGGTTCAAGCCATCCCTGCCCTTGTTCTCCCTGCAACCCAAGCCCTGCCATTCTTC 

1861 + . + + + + + 1920 

AGTTTCCAAGTTCGGTAGGGACGGGAACAAGAGGGACGTTGGGTTCGGGACGGTAAGAAG 

TGTCTAGCCCTTCCCCACTGGCCAACTTCCCCCACTGTCCCGGTCCTCTTCCCCTGAGGT 

1921 : + + + + + + 1980 

ACAGATCGGGAAGGGGTGACCGGTTGAAGGGGGTGACAGGGCCAGGAGAAGGGGACTCCA 

CCCCTGATATCCCCTGGCTCAGTCCTAACAAGACTGAGTCTTAACAAGATGAGAAGTCCT 

1981 + + + + + + 2040 

GGGGACTATAGGGGACCGAGTCAGGATTGTTCTGACTCAGAATTGTTCTACTCTTCAGGA 

CCCCTTCTTGCCTCCCACACTTTTCTTTGATGGGAGGTTTCAATAAACAGCGATAAGAAC 

2041 + + + + + + 2100 

GGGGAAGAACGGAGGGTGTGAAAAGAAACTACCCTCCAAAGTTATTTGTCGCTATTCTTG 

TCTAAAAAAAAAAAAAAAAAA 

2101 + + - 2121 

AGATTTTTTTTTTTTTTTTTT 
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Bffi 2 0 2004 



O . 



CAAATTATTTCTTACGTGACTTTAGAGAAAACGGCTACCTATCTGACCCCAAAACGACTT 



+ _„ + + + + + 6Q 

■* TftAV&$r GTTTAATAAAGAATGCACTGAAATCTCTTTTGCCGATGGATAGACTGGGGTTTTGCTGAA 

GAGGAAACTGTTTCCACGGTCCTGCTGCAGGGGGGAAGCACAGTCGTCAAGAAGAGAGTG 
61 + +■ + + --- + + 120 

CTCCTTTGACAAAGGTGCCAGGACGACGTCCCCCCTTCGTGTCAGCAGTTCTTCTCTCAC 

GGGTCAGGATCAAAACACATTTAGTGTGACTTAGGGAAAGAAAACATTTTCCCTCTTTGA 

121 + + + + + + 180 

CCCAGTCCTAGTTTTGTGTAAATCACACTGAATCCCTTTCTTTTGTAAAAGGGAGAAACT 

ACCTCTCTGGATACAGTCATTTTGCCTCTACTTGAGGATCAACTGTTCAACCTCAATGGC 

181 : + ; + + + + --. + '240 

TGGAGAGACCTATGTCAGTAAAACGGAGATGAACTCCTAGTTGACAAGTTGGAGTTACCG 

M A - 

CTTTCAGGACCTCCTGGGTCACGCTGGTGACCTGTGGAGATTCCAGATCCTTCAGACTGT 

241 + + + + + + 300 

GAAAGTCCTGGAGGACCCAGTGCGACCACTGGACACCTCTAAGGTCTAGGAAGTCTGACA 

FQDLLGHAGDLWRFQ.I L QTV- 

TTTTCTCTCAATCTTTGCTGTTGCTACATACCTTCATTTTATGCTGGAGAACTTCACTGC 

301 + + + + + + 360 

AAAAGAGAGTTAGAAACGACAACGATGTATGGAAGTAAAATACGACCTCTTGAAGTGACG 

FLS IFAVAT Y LHFM LENFTA- 

ATTCATACCTGGCCATCGCTGCTGGGTCCACATCCTGGACAATGACACTGTCTCTGACAA 

361 . + + + + + + 420 

TAAGTATGGACCGGTAGCGACGACCCAGGTGTAGGACCTGTTACTGTGACAGAGACTGTT 

F I PG H R CWVH I LDNDTV S DN 

TGACACTGGGGCCCTCAGCCAAGATGCACTCTTGAGAATCTCCATCCCACTGGACTCAAA 

421 + + + + + + 480 

ACTGTGACCCCGGGAGTCGGTTCTACGTGAGAACTCTTAGAGGTAGGGTGACCTGAGTTT 

DTGALSQDALLRI SI PLDSN- 

CATGAGGCCAGAGAAGTGTCGTCGCTTTGTTCATCCTCAGTGGCAGCTCCTTCACCTGAA 

481 + + + + + + 540 

GTACTCCGGTCTCTTCACAGCAGCGAAACAAGTAGGAGTCACCGTCGAGGAAGTGGACTT 

MRPEKCRRFVHPQW QLLHLN- 

TGGGACCTTCCCCAACACAAGTGACGCAGACATGGAGCCCTGTGTGGATGGCTGGGTGTA 
541 + + + + + + 600 

ACCCTGGAAGGGGTTGTGTTCACTGCGTCTGTACCTCGGGACACACCTACCGACCCACAT 
GTFPNTSDADMEPCVDGWVY 
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TGACAGAATCTCCTTCTCATCCACCATCGGTGACCTGAAGTGGGATCTGGTATGTGACTC 
601 + + + + + + 660 

ACTGTCTTAGAGGAAGAGTAGGTGGTAGCCACTGGACTTCACCCTAGACCATACACTGAG 

DRISFS-ST IGDLK WDLVCDS 

TCAATCACTGACTTCAGTGGCTAAATTTGTATTCATGGCTGGAATGATGTTGGGAGGCAT 

661 + + ■ .+ + + + 720 

AGTTAGTGACTGAAGTCACCGATTTAAACATAAGTACCGACCTTACTACAACCCTCCGTA 

QSLTSVAKFVFMAGM MLGGI - 

CTTAGGCGTTCATTTATCAGACAGGTTTGGGAGAAGTTTCGTGCTCAGATGGTGTTACCT 

721 + + + + + + 780 

GAATCCGCAAGTAAATAGTCTGTCCAAACCCTCTTCAAAGCACGAGTCTACCACAATGGA 

LGVHL SDRFGRSFVLRWCYL - 

CCAGGTTGCCATTGTTGGCACTTGTGCAGCGTTGGCTCCCACTTTCCTCATTTACTGCTC 

781 + + + -- + + - + 840 

GGTCCAACGGTAACAACCGTGAACACGTCGCAACCGAGGGTGAAAGGAGTAAATGACGAG 

. Q V A I V G-TCAA LA P .T F L I Y C S - 

AGTACGCTTCTTGTCTGGGATTGCTGCAATGAGCTTCATAACAAATACTATTATGTTAAT 

841 + + --- + + + + 900 

• TCATGCGAAGAACAGACCCTAACGACGTTACTCGAAGTATTGTTTATGATAATACAATTA 

VRFLSGIAAMSFITNTIMLI- 

AGCCGAGTGGGCAACACACAGATTCCAGGCCATGGGAATTACATTGGGAATGTGCCCTTC 

901 + + + + + + 960 

TCGGCTCACCCGTTGTG.TGTCTAAGGTCCGGTACCCTTAATGTAACCCTTACACGGGAAG 

AEWATHRFQA'MGI TLGMCPS 

TGGTATTGCATTTATGACCCTGGCAGGCCTGGCTTTTGCCATTCGAGACTGGCATATCCT 

961 + + + + + + 1020 

ACCATAACGTAAATACTGGGACCGTCCGGACCGAAAACGGTAAGCTCTGACCGTATAGGA 

G I A.F MTLAGLAFAI RDWH I L - 

CCAGCTGGTGGTGTCTGTACCATACTTTGTGATCTTTCTGACCTCAAGTTGGCTGCTAGA 

1021 + + + + + + 1080 

GGTCGACCACCACAGACATGGTATGAAACACTAGAAAGACTGGAGTTCAACCGACGATCT 

QLVVS VP YFVI FLT S SWLLE 

GTCTGCTCGGTGGCTCATTATCAACAATAAACCAGAGGAAGGCTTAAAGGAACTTAGAAA 

1081 -- + + + ---- + + + 1140 

CAGACGAGCCACCGAGTAATAGTTGTTATTTGGTCTCCTTCCGAATTTCCTTGAATCTTT 

SARWL I INNKPEEGLKELRK- 
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AGCTGCACACAGGAGTGGAATGAAGAATGCCAGAGACACCCTAACCCTGGAGATTTTGAA 

1141 - + + + ; + + + 1200 

TCGACGTGTGTCCTCACCTTACTTCTTACGGTCTCTGTGGGATTGGGACCTCTAAAACTT 

A A H R S GMKNAR DTLT L E I L K - 

ATCCACCATGAAAAAAGAACTGGAGGCAGCACAAAAAAAAAAACCTTCTCTGTGTGAAAT 

1201 + + + + + + 1260 

TAGGTGGTACTTTTTTCTTGACCTCCGTCGTGTTTTTTTTTTTGGAAGAGACACACTTTA ■ 

STM KKELEA AQKKKPSLCE M- 

GCTCCACATGCCCAACATATGTAAAAGGATCTCCCTCCTGTCCTTTACGAGATTTGCAAA 

1261 + + + + + + 1320 

CGAGGTGTACGGGTTGTATACATTTTCCTAGAGGGAGGACAGGAAATGCTCTAAACGTTT 

LHM PN. ICKRISLLSFTRFAN- 

CTTTATGGCCTATTTTGGCCTTAATCTCCATGTCCAGCATCTGGGGAACAATGTTTTCCT 

1321 + + + + + + 1380 

GAAATACCGGATAAAACCGGAATTAGAGGTACAGGTCGTAGACCCCTTGTTACAAAAGGA 

FMAYFGLNL HVQHLGNNVFL- 

GTTGCAGACTCTCTTTGGTGCAGTCATCCTCCTGGCCAACTGTGTTGCACCTTGGGCACT 

1381 + + + + - + --+ 1440 

CAACGTCTGAGAGAAACCACGTCAGTAGGAGGACCGGTTGACACAACGTGGAACCCGTGA 

lqt lfg avillan:cvapwal- 
gaaatacatgaaccgtcgagcaagccagatgcttctcatgttcctactggcaatctgcct 

1441 + + + + + + 1500 

CTTTATGTACTTGGCAGCTCGTTCGGTCTACGAAGAGTACAAGGATGACCGTTAGACGGA 

KYMNRR ASQMLLMFLLAI CL- 

TCTGGCCATCATATTTGTGCCACAAGAAATGCAGACGCTGCGTGAGGTTTTGGCAACACT 

1501 + + + + + + .156-0 

AGACCGGTAGTATAAACACGGTGTTCTTTACGTCTGCGACGCACTCCAAAACCGTTGTGA 

LA I I FVP QEMQTLR EVLATL- 

GGGCTTAGGAGCGTCTGCTCTTGCCAATACCCTTGCTTTTGCCCATGGAAATGAAGTAAT 

1561 + + + + + + 1620 

CCCGAATCCTCGCAGACGAGAACGGTTATGGGAACGAAAACGGGTACCTTTACTTCATTA 

GLGAS ALANTLAFAHGNEVI 

TCCCACCATAATCAGGGCAAGAGCTATGGGGATCAATGCAACCTTTGCTAATATAGCAGG 

1621 + -f + + + + 1680 

AGGGTGGTATTAGTCCCGTTCTCGATACCCCTAGTTACGTTGGAAACGATTATATCGTCC 

PTI IRARAMGINATFANIAG- 
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AGCCCTGGCTCCCCTCATGATGATCCTAAGTGTGTATTCTCCACCCCTGCCCTGGATCAT 

1681 + + + + ' + + 1740 

TCGGGACCGAGGGGAGTACTACTAGGATTCACACATAAGAGGTGGGGACGGGACCTAGTA 

ALAPLMMILSVYSPPLPWI I - 

CTATGGAGTCTTCCCCTTCATCTCTGGCTTTGCTTTCCTCCTCCTTCCTGAAACCAGGAA 

1741 - + + + + + - + 1800 

' GATACCTCAGAAGGGGAAGTAGAGACCGAAACGAAAGGAGGAGGAAGGACTTTGGTCCTT 

YGVFPFISGFAFLL LPETRN- 

CAAGCCTCTGTTTGACACCATCCAGGATGAGAAAAATGAGAGAAAAGACCCCAGAGAACC 

1801 + + + + + + I860 

GTTCGGAGACAAACTGTGGTAGGTCCTACTCTTTTTACTCTCTTTTCTGGGGTCTCTTGG 

K P L F D T I QDEKNERKD PREP- 

AAAGCAAGAGGATCCGAGAGTGGAAGTGACGCAGTTTTAAGGAATTCCAGGAGCTGACTG 

1861 + + + + + + 1920 

TTTCGTTCTCCTAGGCTCTCACCTTCACTGCGTCAAAATTCCTTAAGGTCCTCGACTGAC 

KQEDPRVEV TQ F * 

CCGATCAATGAGCCAGATGAAGGGAACAATCAGGACTATTCCTAGACACTAGCAAAA 

1921 + + + + + 1977 

GGCTAGTTACTCGGTCTACTTCCCTTGTTAGTCCTGATAAGGATCTGTGATCGTTTT 
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CTCCTGATAGCAAAAGAACTGAGGAAGCTCTTTCCACTACGGCTGTATTGCACTGGTGAG 
I „„ + + + + + _ + 60 

GAGGACTATCGTTTTCTTGACTCCTTCGAGAAAGGTGATGCCGACATAACGTGACCACTC 



TCCGGGCCCATGGATGAGAAATTGATGCGAGGATCAATACAAGCTTAATTTGAATTAATA 

61 + + + - + + + 120 

AGGCCCGGGTACCTACTCTTTAACTACGCTCCTAGTTATGTTCGAATTAAACTTAATTAT 

AAAGGAAATATTTTCTCCCTTTGAACTTATCTCCGTAAAGCCATTGTGCCTCCTCTTGGG 

121 + + + + + + 180 

TTTCCTTTATAAAAGAGGGAAACTTGAATAGAGGCATTTCGGTAACACGGAGGAGAACCC 

GGTCACGTGTTCACAATCAATGGCCTTTGAGGAGCTCTTGAGTCAAGTTGGAGGCCTTGG " . 

181 + + + + + + 240 

CCAGTGCACAAGTGTTAGTTACCGGAAACTCCTCGAGAACTCAGTTCAACCTCCGGAACC 

MAFE E L L S QVG GL G - 

GAGATTTCAGATGCTTCATCTGGTTTTTATTCTTCCCTCTCTCATGTTATTAATCCCTCA 

241 + + + + .- + + 300 

CTCTAAAGTCTACGAAGTAGACCAAAAATAAGAAGGGAGAGAGTACAATAATTAGGGAGT 

R F QMLH LVF I-L P S L M L L I P H 

TATACTGCTAGAGAACTTTGCTGCAGCCATTCCTGGTCATCGTTGCTGGGTCCACATGCT 

301 -- + + + + + + 360 

ATATGACGATCTCTTGAAACGACGTCGGTAAGGACCAGTAGCAACGACCCAGGTGTACGA 

I-L L ENFAAA I PG H R C WVHM L - 

GGACAATAATACTGGATCTGGTAATGAAACTGGAATCCTCAGTGAAGATGCCCTCTTGAG 

36! + + + - + + + 420 

CCTGTTATTATGACCTAGACCATTACTTTGACCTTAGGAGTCACTTCTACGGGAGAACTC 

DNNTGSGNE^TGILSEDALLR- 

AATCTCTATCCCACTAGACTCAAATCTGAGGCCAGAGAAGTGTCGTCGCTTTGTCCATCC 

421 + • + + + + + 480 

TTAGAGATAGGGTGATCTGAGTTTAGACTCCGGTCTCTTCACAGCAGCGAAACAGGTAGG 

I S I PLDSNLRP EKCRRFVHP- 

CCAGTGGCAGCTTCTTCACCTGAATGGGACTATCCACAGCACAAGTGAGGCAGACACAGA 

481 + + -- + r- + + + 540 

GGTCACCGTCGAAGAAGTGGACTTACCCTGATAGGTGTCGTGTTCACTCCGTCTGTGTCT 

QW Q L L H.LN G T I H S T'S E A DT E 

ACCCTGTGTGGATGGCTGGGTATATGATCAAAGCTACTTCCCTTCGACCATTGTGACTAA 
541 + + + + + , + goo 

TGGGACACACCTACCGACCCATATACTAGTTTCGATGAAGGGAAGCTGGTAACACTGATT 
PCVDGWVYDQSYFPSTIVTK- 
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GTGGGACCTGGTATGTGATTATCAGTCACTGAAATCAGTGGTTCAATTCCTACTTCTGAC 
601 --- + + + + + + 660 

CACCCTGGACCATACACTAATAGTCAGTGACTTTAGTCACCAAGTTAAGGATGAAGACTG 

WDLVCDYQSLKS-VVQFLLLT- 

TGGAATGCTGGTGGGAGGCATCATAGGTGGCCATGTCTCAGACAGGTTTGGGCGAAGATT ■ 
661 + + + + --- + ---+ 720 

ACCTTACGACCACCCTCCGTAGTATCCACCGGTACAGAGTCTGTCCAAACCCGCTTCTAA 

GMLVGG I IGGHVS DRF GRRF 

TATTCTCAGATGGTGTTTGCTCCAGCTTGCCATTACTGACACCTGCGCTGCCTTCGCTCC 

721 + + ' + + + + 780 

ATAAGAGTCTACCACAAACGAGGTCGAACGGTAATGACTGTGGACGCGACGGAAGCGAGG 

ILRWC LLQL. A I TD TCAAFAP- 

CACCTTCCCTGTTTACTGTGTACTACGCTTCTTGGCAGGTTTTTCTTCCATGATCATTAT 

781 + + + + .- + + 840 

GTGGAAGGGACAAATGACACATGATGCGAAGAACCGTCCAAAAAGAAGGTACTAGTAATA 

TFPVYCVLR FLAGFSSMI I I 

ATCAAATAATTCTTTGCCGATTACTGAGTGGATAAGGCCCAACTCTAAAGCCCTGGTAGT 

841 + + + + + + 900 

TAGTTTATTAAGAAACGGGTAATGACTCACCTATTCCGGGTTGAGATTTCGGGACCATCA 

SNNSLPITEWI RPNSKALVV- 

AATATTGTCATCTGGTGCCCTTAGTATTGGACAGATAATCCTGGGAGGCTTGGCTTATGT 

901 + + + - + + --• + 960 

TTATAACAGTAGACCACGGGAATCATAACCTGTCTATTAGGACCCTCCGAACCGAATACA 

ILSSG ALSI GQI I.LGGLAYV- 

CTTCCGAGACTGGCAAACCCTGCACGTGGTGGCGTCAGTACCTTTCCTTGGCCTCCTTCT 
961 + + + + + + 1020 

GAAGGCTCTGACCGTTTGGGACGTGCACCACCGCAGTCATGGAAAGGAACCGGAGGAAGA 

F R DW QT LHVVA. S V P.F L G LLL - 

CCTTCAAAGGTGGCTGGTGGAATCTGCTCGGTGGTTGATAATCACCAATAAACTAGATGA 

1021 + + + + + + 1080 

GGAAGTTTCCACCGACCACCTTAGACGAGCCACCAACTATTAGTGGTTATTTGATCTACT 

LQRWL- VESARWLI I TN KLDE- 

GGGCTTAAAGGCACTTAGAAAAGTTGCACGCACAAATGGAATAAAGAATGCTGAAGAAAC 

1081 + + + + + + 1140 

CCCGAATTTCCGTGAATCTTTTCAACGTGCGTGTTTACCTTATTTCTTACGACTT.CTTTG 

G LKALRKVARTNG I KNAE ET - 
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CCTGAACATAGAGGTTGTAAGATCCACCATGCAGGAGGAGCTGGATGCAGCACAGACCAA 

1141 + + - + + .+ + 1200 

GGACTTGTATCTCCAACATTCTAGGTGGTACGTCCTCCTCGACCTACGTCGTGTCTGGTT 

LNIEVVRST MQEE LDAAQTK 

AACTACTGTGTGTGACTTGTTCCGCAACCCCAGTATGCGTAAAAGGATCTGTATCCTGGT 

1201 + + + + + + 1260 

TTGATGACACACACTGAACAAGGCGTTGGGGTCATACGCATTTTCCTAGACATAGGACCA 

TTVCDLFRNPSMRKRICI 'LV- 

x ATTTTTGAGATTTGCAAACACAATACCTTTTTATGGTACCATGGTCAATCTTCAGCATGT 

1261 + + + + + + 1320 

TAAAAACTCTAAACGTTTGTGTTATGGAAAAATACCATGGTACCAGTTAGAAGTCGTACA 

FLRFANTI PFY'GTMV NLQHV- 

GGGGAGCAACATTTTCCTGTTGCAGGTACTTTATGGAGCTGTCGCTCTCATAGTTCGATG 

1321 + + --- + + + + 1380 

CCCCTCGTTGTAAAAGGACAACGTCCATGAAATACCTCGACAGCGAGAGTATCAAGCTAC 

G SN I F 'LLQVLY GAVAL IV RC 

TCTTGCTCTTTTGACACTAAATCATATGGGCCGTCGAATAAGCCAGATATTGTTCATGTT 

1381 + --- + + + + + 1440 

AGAACGAGAAAACTGTGATTTAGTATACCCGGCAGCTTATTCGGTCTATAACAAGTACAA 

LALLTLNHMGRRI SQILFMF - 

CCTGGTGGGCCTTTCCATTTTGGCCAACACGTTTGTGCCCAAAGAAATGCAGACCCTGCG 

1441 + + + + + + 1500 

GGACCACCCGGAAAGGTAAAACCGGTTGTGCAAACACGGGTTTCTTTACGTCTGGGACGC 

LVGLS I L.ANTFVPKEMQTLR 

TGTGGCTTTGGCATGTCTGGGAATCGGCTGTTCTGCTGCTACTTTTTCCAGTGTTGCTGT 

1501 + + --■ --- + + + + 1560 

ACACCGAAACCGTACAGACCCTTAGCCGACAAGACGACGATGAAAAAGGTCACAACGACA 

VALACLGIGCSAATFSSVAV- 

TCACTTCATTGAACTCATCCCCACTGTTCTCAGGGCAAGAGCTTCAGGAATAGATTTAAC 

1561 + + - + + + + 1620 

AGTGAAGTAACTTGAGTAGGGGTGACAAGAGTCCCGTTCTCGAAGTCCTTATCTAAATTG 

HFIEL I PTVLRARASGIDLT 

GGCTAGTAGGATTGGAGCAGCACTGGCTGCCCTCTTGATGACCTTAACGGTATTTTTTAC 

1621 + + + + + --- + 1680 

CCGATCATCCTAACCTCGTCGTGACCGAGGGGAGAACTACTGGAATTGCCATAAAAAATG 

ASRIGAALAPLLMTLTVFFT 
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CACTTTGCCATGGATCATTTATGGAATCTTCCCCATCATTGGTGGCCTTATTGTCTTCCT 

1681 + + + --- + "+ + 1740 

GTGAAACGGTACCTAGTAAATACCTTAGAAGGGGTAGTAACCACCGGAATAACAGAAGGA 



TLPWIIYGIFPIIGGLIVFL- 

CCTACCAGAAACCAAGAATCTGCCTTTGCCTGACACCATCAAGGATGTGGAAAATCAAAA 

1741 + + + + + + 1800 

GGATGGTCTTTGGTTCTTAGACGGAAACGGACTGTGGTAGTTCCTACACCTTTTAGTTTT 

LPETKNLPLPDT IKDVENQK- 

AAAAAATCTCAAGGAAAAGGCATAAAAATGATTGCTACACAAAAGTGACCAAATTTTAAG. 

1801 + + + + + + 1860 

TTTTTTAGAGTTCCTTTTCCGTATTTTTACTAACGATGTGTTTTCACTGGTTTAAAATTC 

KNLKEKA* 

' AAGCCTTCATGAGCTGATTGGTGGGGAAATTCAGAAAAAAAAATACAGGAAAAGAACACA 

1861 + + + + + -+ 1920 

TTCGGAAGTACTCGACTAACCACCCCTTTAAGTCTTTTTTTTTATGTCCTTTTCTTGTGT 

CCAGAAGGGTTTTTTTCCCTACAACCAGCAAGAACATATATTAGATACATGAATCTCAAT 

1921 + + + +• + + 1980 

GGTCTTCCCAAAAAAAGGGATGTTGGTCGTTCTTGTATATAATCTATGTACTTAGAGTTA 

TATAATTATGGCATTAATTTGCATTTTATTTCAAAATTAACTTGTGGGGACATGTAATCT 

1981 + + + „-—-- — + ---.. + --- + 2040 

ATATTAATACCGTAATTAAACGTAAAATAAAGTTTTAATTGAACACCCCTGTACATTAGA 

CTTGAGCAATCTGATATTTTTGGGAAGTCCTTTAAAAAGTTACAAATTTATCAATAAATT 

2041 + + + + --- + - --+ 2100 

GAACTCGTTAGACTATAAAAACCCTTCAGGAAATTTTTCAATGTTTAAATAGTTATTTAA 

ACTAGTAGATAAGATGATTCAGAAACAAAGGAAAATCACAGAATTAGGATGTGGCTGGCT 

2101 + + + -- + + + 2160 

TGATCATCTATTCTACTAAGTCTTTGTTTCCTTTTAGTGTCTTAATCCTACACCGACCGA # 

TGGTGTATGAAGCACCATGTGATGAATTCATAAAGTTGCAAAAGTCAAAACAATACTGTA 

2161 + : - + + + + ---+ 2220 

ACCACATACTTCGTGGTACACTACTTAAG.TATTTCAACGTTTTCAGTTTTGTTATGACAT 

C ATGCAAC C AG AAATCAAATTAAATC CAGAAATAGAG ACCTAT ATAAATGC ATTTAATAC 

2221 ■+ + + + + + 2280 

GTACGTTGGTCTTTAGTTTAATTTAGGTCTTTATCTCTGGATATATTTACGTAAATTATG 

ATGATACTTTTGACATATTAAGCCATTGGAAAACGGAAGGATTAGATACTTAAATAACAT 

2281 ----- + ---- + -- + + + + 2340 

TACTATGAAAACTGTATAATTCGGTAACCTTTTGCCTTCCTAATCTATGAATTTATTGTA 

TGCTATCTCTTTGTAAATACAGTCACTAAATGATGTTAGTTACTTTTCCATGGTGGAATT 

2341 + + + + + -- r + 2400 

ACGATAGAGAAACATTTATGTCAGTGATTTACTACAATCAATGAAAAGGTACCACCTTAA 



FIGURE 6 
Sheet 4 of 5 



TTAATTACTTTTTCTTTGTAATTTTTCTCTCTGTATATTTTAAACAAATAGCTGGTATAG 

2401 + + + + + -- + 2460 

AATTAATGAAAAAGAAACATTAAAAAGAGAGACATATAAAATTTGTTTATCGACCATATC 

TTTACAATATTATAAAGATATTGTTCAAATTGAAGGGCAAAGGCCAGGTTCAGCAATTTT 

2461 + + + + + + 25.20 

AAATGTTATAATATTTCTATAACAAGTTTAACTTCCCGTTTCCGGTCCAAGTCGTTAAAA 

CAAACTGTATGTACATTTAATAAAATAACTATAAATTAAAAAATTATATTTCAAATGATG 

2521 + + + + + + 2580 

GTTTGACATACATGTAAATTATTTTATTGATATTTAATTTTTTAATATAAAGTTTACTAC 

TGACTAATAAATGAAAGTACATATAGTAGTAAAGTAATTTCAGGCAAACCTATATAACCA 

2581 + + --- + + + + 2640 

ACTGATTATTTACTTTCATGTATATCATCATTTCATTAAAGTCCGTTTGGATATATTGGT 

AAATATAAACTTTCATTTTAAACAGCAAAAAAAAAAAAAAAAAA 

2641 + + + + 2684 

TTTATATTTGAAAGTAAAATTTGTCGTTTTTTTTTTTTTTTTTT 
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Control OAT1 
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PAH Transport by EcR293-OAT2A 
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